Prominent effect of soil network heterogeneity on microbial invasion 
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Using a network representation for real soil samples and mathematical models for microbial spread, 
we show that the structural heterogeneity of the soil habitat may have a very significant influence 
on the size of microbial invasions of the soil pore space. In particular, neglecting the soil structural 
heterogeneity may lead to a substantial underestimation of microbial invasion. Such effects are 
explained in terms of a crucial interplay between heterogeneity in microbial spread and heterogeneity 
in the topology of soil networks. The main influence of network topology on invasion is linked to the 
existence of long channels in soil networks that may act as bridges for transmission of microorganisms 
between distant parts of soil. 



Understanding how the ubiquitous structural hetero- 
geneity of natural habitats affects the movement and spa- 
tial distribution of biota is an important and fascinating 
question relevant to several disciplines [1, 2]. In particu- 
lar, the soil pore space is a highly heterogeneous habitat 
hosting a stunning wealth of biological activity (e.g. of 
bacteria, fungi or nematodes [3-5]) that plays an essential 
role in many processes including plant growth [6] , climate 
change [7] , or soil-borne epidemics [8] . The study of the 
interplay between soil structural heterogeneity and mi- 
crobial activities in three dimensions (3D) is challenging 
due to the opacity of soil and the complexity of biological 
and environmental factors involved in microbial spread. 
Experiments based on soil thin sections [9-13] or planar 
microcosms [9, 14, 15] give some insight. For instance, it 
was observed that the volume of soil explored by fungi 
increases with the soil bulk density [12] and macropores 
may act as either preferential pathways or barriers for 
fungal spread [13]. However, due to the nature of the 
techniques, these type of experiments fail to provide in- 
formation in 3D so as to quantitatively assess the influ- 
ence of the structural heterogeneity and topology on mi- 
crobial invasion. Current understanding based on ecolog- 
ical and epidemiological models suggests that heterogene- 
ity in soil structure could either enhance or reduce the 
probabilities of invasions [16-19]. The outcome depends 
largely on the properties of the pore space, including the 
connectivity and pore sizes, and the effects these prop- 
erties have on microbial movement through soil. In this 
letter, we identify the main structural factors that affect 
invasion by devising several network models for biologi- 
cal invasion with increasing degree of interplay between 
microbial spread and the structure of the soil pore space 
(Table I). Our results and conclusions are not only rele- 
vant to biological invasion in soil but are also expected to 
be important for any biota moving in complex landscapes 
or generic agents spreading in networks with structurally 
complex links. 

We have analysed the invasion models for six real 
soil samples: three samples of soil without tillage treat- 



ment (denoted as Nl, N2, and N3) and three samples of 
ploughed soil (PI, P2, and P3). The main qualitative 
difference between N and P samples is that the pores 
are typically larger in the P samples (see Fig. 1(a) and 
the statistical analysis in [20]). For our theoretical anal- 
ysis, we have used a network representation of the soil 
pore structure derived from 3D-digital images of the soil 
samples scanned with an X-ray micro-tomography de- 
vice [20]. Soil networks consist of a set of nodes and 
edges whose layout captures the topology of the soil pore 
space where biological activity takes place. The network 
representation is achieved by associating the branching 
points of the soil pore space with the nodes and the 
pore-space channels between branching points with the 
network edges (see [20] for more detail and a compar- 
ison with previously proposed network representations 
for soil [21-24]). Fig. 1(b) shows the network for sam- 
ple Nl. All the networks have similar topological prop- 
erties irrespective of tillage treatment (Table II). The 
samples exhibit limited node degree, small topological 
heterogeneity, high clustering in comparison with ran- 
dom graphs, and fractal small-world behaviour (i.e. the 
mean separation between nodes increases as (I) <~ N v 
with 7] ~ 0.4, which contrasts with the slower increase 
(I) <~ lnA^ in standard small- wold networks [25]). Such 
features are typical for geographical networks embedded 
in Euclidean spaces [26, 27] and are thus not un-expected 
for soil networks that are embedded in a 3D space. The 
connectivity in this kind of networks is limited because 
each edge fills a certain space and thus the number of 
edges per node is restricted [20, 26, 27]. This property 
remains unaltered under tillage and this is likely to be 
the reason why the topologies are statistically similar for 
both N and P samples. We describe the structure of 
channels in terms of their arc-length, L, and local cross- 
section area S(x) (Fig. 1). Both the mean value (L) and 
the relative dispersion cr£/(L) 2 of the arc-length are sim- 
ilar for all the samples (Table II) . A relatively weak het- 
erogeneity in L, i.e. a^/(L} 2 < 1, contrasts with much 
greater variability in the cross-section area along chan- 
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(a) No tillage (Nl) Ploughed (PI) 




FIG. 1. Soil samples and soil networks, (a) 2D-sections of 
3D-digital images of soil samples Nl and PI (sample size: 
3.5 x 3.5 x 3.5 cm). The pore space and solid matrix are 
shown in white and grey colours, respectively, (b) Network 
representing the pore space of sample Nl. The colour of chan- 
nels indicates their local cross-section area, S(x), where x 
gives the position along channels. The zoom on the right il- 
lustrates the definition of the arc-length of channels, L, the 
node degree heterogeneity, and clustering (triangle 1-2-3) (cf. 
Table II). Both L and S(x) are dimensionless, i.e. scaled by 
the size of voxel in the 3D-digital image. 

nels, i.e. ct|/(S') 2 > 1. Large cross-sections are typical 
for long channels which are frequently attached to nodes 
with large k. Spatial correlations of S(x) are significant 
(i.e. narrow channels tend to be attached to narrow chan- 
nels and vice- versa) [20]. Here, we show that the hetero- 
geneity in S (x), the correlations between L and S(x) and 
spatial correlations for S(x) play a key role for microbial 
invasion. 

The spread of micro-organisms through a given pore 
space in soil is not a deterministic process but it occurs 
with certain probability [8, 9]. Inspired by epidemio- 
logical network models [17-19, 30-32], we assume that 
microorganisms reaching a node in the soil network are 
able to colonise any of the channels that emerge from 
that node and to reach uncolonised nodes with probabil- 
ity T (referred to as the transmissibility) . This quantity 
is central for all the models proposed in this work. Each 
model assumes a different form for T (Table I). Model 1 
corresponds to the simplest mean-field case with T being 
identical for all the channels. In model 2, T is indepen- 
dent of the structural properties of channels but it takes a 
random value for each channel (representing, e.g. a non- 
uniform spatial distribution of nutrient resources neces- 
sary for microbial activity [33]) that obeys a bi-modal 
distribution parameterised by the mean transmissibility, 
(T) . Model 3 suggests that T depends on the arc-length 
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FIG. 2. Invasion in soil networks. The vertical axis displays 
the density of colonisation, tic, in network Nl averaged over 
10 4 invasions starting from different randomly chosen nodes. 
The horizontal axis shows the mean transmissibility of mi- 
crobes, (T), obtained by averaging over channels. Different 
curves correspond to invasions predicted by different models 
(and exploration controlled by a in model 4). 



of channels (L) and the spatial scale of microbial coloni- 
sation is characterised by a typical exploration length, 
Ao- The value of transmissibility is assumed to decay 
with increasing L, meaning that microbial transmission 
through short channels is more likely than through longer 
channels. In Model 4, T depends on both L and the cross- 
section area along channels, S(x). The dependence on L 
is again controlled by the parameter Ao- Regarding S(%), 
we keep our description general so as to account both for 
microorganisms with preferential spread through pores 
with wide cross-sections and those that have a prefer- 
ence for narrow cross-sections. This preference may de- 
pend on a combination of biological and physical factors 
such as competitive exclusion from pore-size classes due 
to predator-prey interactions [1, 3, 16], or the spatial dis- 
tribution of water in soil [3, 9, 10, 15]. We capture these 
factors qualitatively with an effective area exploration pa- 
rameter, a, whose sign controls the preference for narrow 
(a > 0) or wide (a < 0) cross-sections. For a — 0, model 
4 reduces to model 3 (cf. expressions for T in Table I). 

We quantify the size of invasion by the density of 
colonisation, up, defined as the relative number of nodes 
reached by the microbial colony during an invasion that 
starts from a randomly chosen node in the network. Nu- 
merical simulations reveal that the mean value of nc 
predicted by model 1 increases with T but only takes 
significantly large values (e.g. nc > 0.1) if T > T c , 
where T c ~ 0.6 for all the analysed networks (Fig. 2 and 
[20] ) . Similar threshold behaviour for invasion is typically 
observed in some epidemiological models [17-19, 30-32]. 
The heterogeneity in T considered in model 2 does not in- 
troduce significant differences to nc which coincides with 
that for model 1 if the strength of microbial transmis- 
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FIG. 3. Bridging effect for edges in network Nl. The colour 
along the vertical direction gives the fraction of edges with 
given L (i.e. fixed value on the horizontal axis) whose de- 
gree of bridging is larger than the value B plotted along the 
vertical axis. For instance, the colour inside the circle (trian- 
gle) indicates that the fraction of channels of length L — 10 
(L — 50) with B > 0.9, i.e. l\, > 10, is approximately 0.1 
(0.7). By definition, B G [0,1] but only values of B > 0.7 
are shown to highlight the variation in B with L. Channels 
attached to border nodes of degree 1 are not included in the 
graph. See [20] for similar plots for other soil samples. 




FIG. 4. (a) Mean cross-section area of channels, {S)ht, (b) 
mean arc-length, (-L)ht and mean bridgeness, (B)ht, ob- 
tained by averaging over channels with high transmissibility 
(HT, T > (T)) in sample Nl for microbes with (T) = 0.5. 
(c) Plots of the channels with T > 0.5 coloured according to 
their value of L. 



sion is parametrised by (T) in both models. This result 
holds in general if the values of T for all the channels 
in the network are statistically independent from each 
other [34, 35]. For given (T), model 3 predicts coloni- 
sations of smaller size on average than that for model 1 
(the colonisation curve for model 3 is clearly below the 
curve for model 1 in Fig. 2). Therefore, heterogeneity in 
T induced by heterogeneity in L makes the soil network 
more resilient to microbial invasion [36]. Model 4 pre- 
dicts a similar behaviour for values of a near zero which 
is expected since models 4 and 3 coincide for a = (see, 
e.g. the invasion curve for a = 1 in Fig. 2). In con- 
trast, for larger values of the area exploration parameter 
(either a > or a < 0), invasions for given (T) can 
be more significant in model 4 than in any of the other 
models. The effect is especially pronounced for large neg- 
ative a (compare, e.g. the curve for a — —5 with that 
for homogeneous T in Fig. 2). This is a clear illustra- 
tion of the prominent effect of the strong heterogeneity 
in cross-sections of soil channels on biological invasion. 

The above results can be understood in terms of the 
intuitive idea that some channels act as bridges that link 
nodes that would otherwise be further apart or discon- 
nected. Accordingly, if some microorganisms are able to 
colonise channels with high "bridging-effects" , the result- 
ing invasions should be large. This behaviour is reminis- 
cent of the small-world effect [29, 34] and is related to 
the fractal small- world property of soil networks [25]. In 
order to quantify the bridge effect for every pair, u — v, of 
connected nodes we measure the range of the edge u — v 



[37] which gives the number of links, l^, along the short- 
est path from u to v, if the edge linking these two nodes 
is removed and define the bridgeness, B, as B = 1 — 1/lb- 
Channels with B close to unity indicate the presence of 
bridges for transmission because the shortest alternative 
path has a large number of steps, lb- Fig. 3 shows that 
B is heterogeneous but is correlated with the channel 
length, typically taking larger values for channels with 
long arc-length. This effect is ultimately responsible for 
the variations between invasion curves in Fig. 2 for dif- 
ferent models. 

In models 1 and 2 for invasion, T is not linked to the 
structural properties of channels and is thus independent 
of B. In contrast, model 3 assumes that T decays mono- 
tonically with L meaning that microorganisms are more 
likely to be transmitted through short channels which 
typically have small B (rather than through channels of 
any length as in models 1 and 2). This explains why the 
typical size of invasions for given (T) is smaller in model 3 
than predicted by models 1 and 2. In contrast, the trans- 
missibility in model 4 depends on S(x) (provided a^0) 
and does not decay monotonically with L as in model 
3 (see Sec. VI in [20]). As a result, some long chan- 
nels with high degrees of bridging are able to transmit 
microbes more efficiently than other with smaller L and 
B. Fig. 4 illustrates this idea for microbial invasion with 
given (T) = 0.5 by showing that the cross-section area 
of highly transmissible channels with T > (T) decreases 
with a (Fig. 4(a)) but their length and bridgeness show 
a minimum for a ~ (Fig. 4(b) and (c)). The larger val- 
ues of B observed for a < (i.e. when wide channels are 
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preferred) are due to the positive correlations between 
S(x) and L. In spite of these correlations, invasions with 
positive a (preference for narrow channels) may have a 
large value of B because S(x) is very heterogeneous and 
there are channels that are both narrow and long [20]. 
The invasions corresponding to a ^ are then typically 
larger than predicted by models 1-3 (Fig. 2). The effect 
is particularly important for a < because wide chan- 
nels (preferred for a < 0) tend to be longer than narrow 
channels. Similarly, the significant spatial correlations in 
S favour invasion in model 4 (see [20]). 

According to our analysis, the global traits of inva- 
sion are mainly dictated by generic characteristics such as 
the heterogeneity in the network (i.e. in channel length 
and degree of bridging) and transmission. An impor- 
tant conclusion which applies to invasions in any hetero- 
geneous landscape is that a limited parametrisation of 
microbial transmission (e.g. in terms of (T)) combined 
with an insufficient description of structural heterogene- 
ity in models (e.g. as in models 1-3) may lead to serious 
underestimates of the size of invasions. This highlights 
the importance of capturing (i) the essential features of 
structural heterogeneity that affect microbial spread and 
(ii) the appropriate parametrisation of microbial trans- 
mission which takes into account the effects of structural 
heterogeneity. Our results suggest that describing mi- 
crobial transmission in terms of the parameters Ao and a 
(Model 4) is more appropriate than using just (T). In- 
deed, A and a give a better description of the effects of 
channel structure on microbial spread and reveal signifi- 
cant differences in invasion for ploughed and unploughed 
soil (microorganisms with preference for wide channels 
invade more in ploughed soil, and vice versa [20]). 

To conclude, our work demonstrates that the shape, 
size, and interconnection of pores in addition to other 
characteristics influencing the value of the area explo- 
ration parameter a (e.g. type of microorganisms) are the 
key factors determining the extend of microbial invasion 
in soil. The interplay between heterogeneity in micro- 
bial transmission and heterogeneity in the topology of 
soil networks plays a crucial role for biological invasions 
in soil. 
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TABLE I. Models for microbial spread. T is the trans- 
missibility. L (S) is the arc-length (cross-section) of chan- 
nels. Ao (a) is the length (area) exploration parameter. 
S a = L^ 1 fy[S(x)] a dx is the average of S(x) a along chan- 
nels. Derivations of T for models 3 and 4 are presented in 

m 

Model Heterogeneity Parameters Transmissibility 

1 None 7 

2 Non-structural (T) 

3 L Ao 

4 L, S(x) A , a 



T 

0, with Prob 1 - (T) 

1, with Prob (T) 
exp(— L/Ao) 

exp(— L S a /Xo) 



TABLE II. Network topological characteristics and channel properties for six soil samples. TV and E denote the number of 
nodes and the number of edges in networks, respectively, (k) is the mean degree and ai/(k) 2 = (k 2 ) / (k) 2 — 1 is a measure of 
the topological heterogeneity [28] . The clustering C is given relative to C ran d = (k) /N for a random graph with the same value 
of TV and (k) [29]. The mean separation length, (I), gives the typical separation between two nodes in the network [29]. (L) 
and ct£/(L) 2 are the mean value and dispersion of L. Analogous quantities for cross-section area S are given in the last two 
columns. 
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Nl 


49709 


69563 


2.80 


0.165 


763.7 


73.05 


6.47 


0.362 


12.03 


2.51 


N2 


58618 


82949 


2.83 


0.180 


1000.4 


74.67 


5.94 


0.348 


11.53 


6.66 


N3 


54083 


76747 


2.84 


0.165 


848.0 


64.26 


6.53 


0.382 


22.80 


30.60 


PI 


33526 


45544 


2.72 


0.162 


488.2 


69.61 


6.85 


0.401 


19.18 


3.23 


P2 


47388 


66147 


2.79 


0.156 


667.6 


66.65 


6.67 


0.366 


17.33 


5.27 


P3 


27042 


36125 


2.67 


0.165 


368.3 


70.73 


7.15 


0.450 


21.31 


4.00 



